Boosting for high-dimensional time-to-event data with competing risks

نویسندگان

  • Harald Binder
  • Arthur Allignol
  • Martin Schumacher
  • Jan Beyersmann
چکیده

MOTIVATION For analyzing high-dimensional time-to-event data with competing risks, tailored modeling techniques are required that consider the event of interest and the competing events at the same time, while also dealing with censoring. For low-dimensional settings, proportional hazards models for the subdistribution hazard have been proposed, but an adaptation for high-dimensional settings is missing. In addition, tools for judging the prediction performance of fitted models have to be provided. RESULTS We propose a boosting approach for fitting proportional subdistribution hazards models for high-dimensional data, that can e.g. incorporate a large number of microarray features, while also taking clinical covariates into account. Prediction performance is evaluated using bootstrap.632+ estimates of prediction error curves, adapted for the competing risks setting. This is illustrated with bladder cancer microarray data, where simultaneous consideration of both, the event of interest and competing events, allows for judging the additional predictive power gained from incorporating microarray measurements. AVAILABILITY The proposed boosting approach is implemented in the R package CoxBoost and prediction error estimation in the package peperr, both available from CRAN.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Competing Risks Data Analysis with High-dimensional Covariates: An Application in Bladder Cancer

Analysis of microarray data is associated with the methodological problems of high dimension and small sample size. Various methods have been used for variable selection in high-dimension and small sample size cases with a single survival endpoint. However, little effort has been directed toward addressing competing risks where there is more than one failure risks. This study compared three typ...

متن کامل

Determining factors contributing to the five-year survival of children suffering from acute lymphoblastic leukemia based on tree survival model in the presence of competing risks

Abstract Background and objectives: Leukemia is one of the most prevalent cancers worldwide. The relapse of the disease mitigates patient survival time. The convenience of explaining the results obtained from analyzing tree models have encouraged doctors and paramedics to employ them in their research. The current study is an attempt to determine the five-year survival time and factors influen...

متن کامل

Comparison of Random Survival Forests for Competing Risks and Regression Models in Determining Mortality Risk Factors in Breast Cancer Patients in Mahdieh Center, Hamedan, Iran

Introduction: Breast cancer is one of the most common cancers among women worldwide. Patients with cancer may die due to disease progression or other types of events. These different event types are called competing risks. This study aimed to determine the factors affecting the survival of patients with breast cancer using three different approaches: cause-specific hazards regression, subdistri...

متن کامل

طول عمر بیماران مبتلا به سرطان معده پس از عمل جراحی:تحلیلی بر اساس رقابت جویی خطرات

Background and Aim: Many researchers have studied survival (time to death) of gastric cancer patients. Although gastric cancer diagnosed in early stages can be cured by surgery, chance of relapse still exists after operation. Hence, we should consider both events, that is, relapse of the disease and death, in order to be able to make a more precise estimation for survival of the patients. The p...

متن کامل

Parametric Estimation in a Recurrent Competing Risks Model

A resource-efficient approach to making inferences about the distributional properties of the failure times in a competing risks setting is presented. Efficiency is gained by observing recurrences of the compet- ing risks over a random monitoring period. The resulting model is called the recurrent competing risks model (RCRM) and is coupled with two repair strategies whenever the system fails. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Bioinformatics

دوره 25 7  شماره 

صفحات  -

تاریخ انتشار 2009